Predicting Crime Reporting with Decision Trees and the National Crime Victimization Survey

نویسندگان

  • Juliette Gutierrez
  • Gondy Leroy
چکیده

Crime reports are used by law enforcement to find criminals, prevent further violations, identify problems causing crimes and allocate government resources. Unfortunately, many crimes go unreported. This may lead to an incorrect crime picture and suboptimal responses to the existing situation. Our goal is to use a data mining approach to increase understanding of when crime is reported or not. An increased understanding could lead to new, more effective programs to fight crime or changes to existing programs. We use the National Crime Victimization Survey (NCVS) which comprises data collected from 45,000 households about incidents, victims, suspects and if the incident was reported or not. We use decision trees to predict when incidents are reported or not. We compare decision trees that are built based on domain knowledge with those automatically created. For the automatically created trees, we compare three variable selection methods: two filters, Chi-squared and Cramer’s V Coefficient, and a forward selection wrapper. We found that the decision trees that are automatically constructed are as accurate as those based on domain knowledge while they show a different picture. We conclude that decision trees lead to several new hypotheses for criminologists while they are automatically constructed and easy to understand which makes them practical and useful.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Running head: USING DECISION TREES TO PREDICT CRIME REPORTING

Crime reports are used to find criminals, prevent further violations, identify problems causing crimes and allocate government resources. Unfortunately, many crimes go unreported. The National Crime Victimization Survey (NCVS) comprises data about incidents, victims, suspects and if the incident was reported or not. Current research using the NCVS is limited to statistical techniques resulting ...

متن کامل

Predicting Violent Crime Rates for the 2010 Redesign of the National Crime Victimization Survey (NCVS)

The National Crime Victimization Survey (NCVS) is a major crime survey for the United States. The survey collects data on several types of crimes, including the broad categories of violent crime and property crime. The 2010 redesign of the NCVS can potentially improve the efficiency of the survey if the level of crime can be predicted well by external data. Previously, we reported initial succe...

متن کامل

Characteristics of Crimes Against Juveniles

The FBI’s Uniform Crime Reporting (UCR) system and the Bureau of Justice Statistics’ National Crime Victimization Survey do not collect information about crimes committed against persons under 12 years of age and thus do not provide a comprehensive picture of juvenile crime victimization. Designed to replace UCR as the national database for crimes reported to law enforcement, the FBI’s National...

متن کامل

Reporting Crime Victimizations to the Police and the Incidence of Future Victimizations: A Longitudinal Study

BACKGROUND Law enforcement depends on cooperation from the public and crime victims to protect citizens and maintain public safety; however, many crimes are not reported to police because of fear of repercussions or because the crime is considered trivial. It is unclear how police reporting affects the incidence of future victimization. OBJECTIVE To evaluate the association between reporting ...

متن کامل

A Bayesian Belief Network Classifier for Predicting Victimization in National Crime Victimization Survey

This paper presents the development of a Bayes net classifier for prediction of a victimization attribute value for the National Crime Victimization Survey dataset. The National Crime Victimization Survey dataset has over 250 attributes and 216,000 data points, and as such poses a large-scale problem context for classifier development. The classifier was developed using the Weka machine learnin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007